NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

LazyLog: A New Shared Log Abstraction for Low-Latency Applications

https://doi.org/10.1145/3694715.3695983

Luo, Xuhao; Bhat, Shreesha G; Hu, Jiyu; Alagappan, Ramnatthan; Ganesan, Aishwarya (November 2024, ACM)

Shared logs offer linearizable total order across storage shards. However, they enforce this order eagerly upon ingestion, leading to high latencies. We observe that in many modern shared-log applications, while linearizable ordering is necessary, it is not required eagerly when ingesting data but only later when data is consumed. Further, readers are naturally decoupled in time from writers in these applications. Based on this insight, we propose LazyLog, a novel shared log abstraction. LazyLog lazily binds records (across shards) to linearizable global positions and enforces this before a log position can be read. Such lazy ordering enables low ingestion latencies. Given the time decoupling, LazyLog can establish the order well before reads arrive, minimizing overhead upon reads. We build two LazyLog systems that provide linearizable total order across shards. Our experiments show that LazyLog systems deliver significantly lower latencies than conventional, eager-ordering shared logs.
more » « less
Full Text Available
Automatic Reliability Testing for Cluster Management Controllers

Sun, Xudong; Luo, Wenqing; Gu, Jiawei Tyler; Ganesan, Aishwarya; Alagappan, Ramnatthan; Gasch, Michael; Suresh, Lalith; Xu, Tianyin (July 2022, Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI'22))

Modern cluster managers like Borg, Omega and Kubernetes rely on the state-reconciliation principle to be highly resilient and extensible. In these systems, all cluster-management logic is embedded in a loosely coupled collection of microservices called controllers. Each controller independently observes the current cluster state and issues corrective actions to converge the cluster to a desired state. However, the complex distributed nature of the overall system makes it hard to build reliable and correct controllers – we find that controllers face myriad reliability issues that lead to severe consequences like data loss, security vulnerabilities, and resource leaks. We present Sieve, the first automatic reliability-testing tool for cluster-management controllers. Sieve drives controllers to their potentially buggy corners by systematically and extensively perturbing the controller’s view of the current cluster state in ways it is expected to tolerate. It then compares the cluster state’s evolution with and without perturbations to detect safety and liveness issues. Sieve’s design is powered by a fundamental opportunity in state-reconciliation systems – these systems are based on state-centric interfaces between the controllers and the cluster state; such interfaces are highly transparent and thereby enable fully-automated reliability testing. To date, Sieve has efficiently found 46 serious safety and liveness bugs (35 confirmed and 22 fixed) in ten popular controllers with a low false-positive rate of 3.5%.
more » « less
Full Text Available
Exploiting Nil-Externality for Fast Replicated Storage

Ganesan, Aishwarya; Alagappan, Ram; Arpaci-Dusseau, Andrea; Arpaci-Dusseau, Remzi (October 2021, Proceedings of the Symposium on Operating Systems Principles)
null (Ed.)
Do some storage interfaces enable higher performance than others? Can one identify and exploit such interfaces to realize high performance in storage systems? This paper answers these questions in the affirmative by identifying nilexternality, a property of storage interfaces. A nil-externalizing (nilext) interface may modify state within a storage system but does not externalize its effects or system state immediately to the outside world. As a result, a storage system can apply nilext operations lazily, improving performance. In this paper, we take advantage of nilext interfaces to build high-performance replicated storage. We implement SKYROS, a nilext-aware replication protocol that offers high performance by deferring ordering and executing operations until their effects are externalized. We show that exploiting nil-externality offers significant benefit: for many workloads, SKYROS provides higher performance than standard consensus-based replication. For example, SKYROS offers 3x lower latency while providing the same high throughput offered by throughput-optimized Paxos.
more » « less
Full Text Available
Reasoning about modern datacenter infrastructures using partial histories

https://doi.org/10.1145/3458336.3465276

Sun, Xudong; Suresh, Lalith; Ganesan, Aishwarya; Alagappan, Ramnatthan; Gasch, Michael; Tang, Lilia; Xu, Tianyin (June 2021, In Proceedings of the 18th Workshop on Hot Topics in Operating Systems (HotOS-XVIII))
null (Ed.)
Modern datacenter infrastructures are increasingly architected as a cluster of loosely coupled services. The cluster states are typically maintained in a logically centralized, strongly consistent data store (e.g., ZooKeeper, Chubby and etcd), while the services learn about the evolving state by reading from the data store, or via a stream of notifications. However, it is challenging to ensure services are correct, even in the presence of failures, networking issues, and the inherent asynchrony of the distributed system. In this paper, we identify that partial histories can be used to effectively reason about correctness for individual services in such distributed infrastructure systems. That is, individual services make decisions based on observing only a subset of changes to the world around them. We show that partial histories, when applied to distributed infrastructures, have immense explanatory power and utility over the state of the art. We discuss the implications of partial histories and sketch tooling for reasoning about distributed infrastructure systems.
more » « less
Full Text Available
Strong and Efficient Consistency with Consistency-aware Durability

Ganesan, Aishwarya; Alagappan, Ram; Arpaci-Dusseau, Andrea; Arpaci-Dusseau, Remzi (February 2021, ACM transactions on storage)
null (Ed.)
We introduce consistency-aware durability or Cad, a new approach to durability in distributed storage that enables strong consistency while delivering high performance. We demonstrate the efficacy of this approach by designing cross-client monotonic reads, a novel and strong consistency property that provides monotonic reads across failures and sessions in leader-based systems; such a property can be particularly beneficial in geo-distributed and edge-computing scenarios. We build Orca, a modified version of ZooKeeper that implements Cad and cross-client monotonic reads. We experimentally show that Orca provides strong consistency while closely matching the performance of weakly consistent ZooKeeper. Compared to strongly consistent ZooKeeper, Orca provides significantly higher throughput (1.8--3.3×) and notably reduces latency, sometimes by an order of magnitude in geo-distributed settings. We also implement Cad in Redis and show that the performance benefits are similar to that of Cad’s implementation in ZooKeeper.
more » « less
Full Text Available
Strong and Efficient Consistency with Consistency-Aware Durability

Ganesan, Aishwarya; Alagappan, Ramnatthan; Arpaci-Dusseau, Andrea; Arpaci-Dusseau, Remzi (January 2020, 18th USENIX Conference on File and Storage Technologies (FAST '20))

We introduce consistency-aware durability or CAD, a new approach to durability in distributed storage that enables strong consistency while delivering high performance. We demonstrate the efficacy of this approach by designing cross-client monotonic reads, a novel and strong consistency property that provides monotonic reads across failures and sessions in leader-based systems. We build ORCA, a modified version of ZooKeeper that implements CAD and cross-client monotonic reads. We experimentally show that ORCA provides strong consistency while closely matching the performance of weakly consistent ZooKeeper. Compared to strongly consistent ZooKeeper, ORCA provides significantly higher throughput (1.8 – 3.3×), and notably reduces latency, sometimes by an order of magnitude in geo-distributed settings.
more » « less
Full Text Available
Getting more performance with polymorphism from emerging memory technologies

https://doi.org/10.1145/3319647.3325826

Narayanan, Iyswarya; Ganesan, Aishwarya; Badam, Anirudh; Govindan, Sriram; Sharma, Bikash; Sivasubramaniam, Anand (May 2019, 12th ACM International Conference on Systems and Storage)

Full Text Available

Search for: All records